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Abstract 

For purposes of Value-at-Risk estimation, we consider several multivariate fam- 
ilies of heavy-tailed distributions, which can be seen as multidimensional versions 
of Paretian stable and Student's t distributions allowing different marginals to have 
different indices of tail thickness. After a discussion of relevant estimation and 
simulation issues, we conduct a backtesting study on a set of portfolios containing 
derivative instruments, using historical US stock price data. 
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1 Introduction 

The purpose of this paper is to assess the performance of some classes of multivariate 
laws with heavy tails in the estimation of Value-at-Risk for nonlinear portfolios. The 
inadequacy of Gaussian laws, in one or several dimensions, to model the distribution of 
risk factors, especially in view of applications to risk modeling, is well-documented in 
the empirical literature (see e.g. [H [10] and references therein). Here we concentrate 
on models for risk factors that are multivariate extensions of the classical a-stable and 
Student's t distributions. In particular, we consider multivariate laws whose marginals 
may have different indices of tail thickness, and/or whose structures allow for tail de- 
pendence (i.e., roughly speaking, extreme movements of several risk factors may happen 
together). 

Let us briefly recall how VaR is usually estimated for nonlinear portfolios (i.e. for 
portfolios containing derivative instruments), and what kind of improvements have been 
proposed. In the simplest setting, one uses a linear approximations of losses with nor- 
mally distributed risk factors: denoting by L the loss over a certain time period, one sets 
L ~ {A,X), where X ~ N{m,Q) is a d-dimensional vector of Gaussian risk factors, A 
is an element of M , and (•, •) stands for the usual scalar product of two vectors. Then 
(A,X) follows a one-dimensional Gaussian distribution with mean {A,m) and variance 
{QA,A), so that (an approximation of) VaR can be obtained immediately. However, it 
is clear that such a scheme suffers from two major weaknesses: the linear approximation 
is inaccurate, as the payoff function of derivatives is usually highly non-linear, and the 
hypothesis that random factors are Gaussian is often inappropriate, as briefly mentioned 
above (the literature on this issue is very rich - see e.g. [5l \T2\ I13j . to mention just a 
few classical references). Among the many improvements that have been suggested in 
literature, some focus on a better modeling of the nonlinear relation between L and X 
(e.g. by using quadratic approximations of the type L ~ {A,X) + {TX,X)), but still 
assuming X Gaussian (see e.g. [in])j while others introduce alternative distributions of 
portfolio losses, often just in the univariate setting (see e.g. [151 121] )• To the best of 



our knowledge, however, there are only a small number of studies devoted to models 
that take into account both non-linearities and non-normality in a multivariate setting: 
Duffie and Pan [11] and Glasserman et al. [16] adopt the quadratic approximation and 
non-Gaussian risk factors. In particular, risk factors include a jump component in the 
first work, and are modeled by multivariate t distributions (or a modification thereof) 
in the latter. However, both works are devoted to different issues (analytic approxima- 
tions and efficient simulation techniques, respectively), therefore they do not address 
the statistical issues related to the implementation of their models, and do not test their 
empirical performance on real data. 

Our contributions are the following: we introduce a stable-like model for risk factors 
obtained by multivariate subordination of a Gaussian law on M'^ (see ^, such that each 
marginal (i.e. each risk factor) can have a different index of tail thickness. We con- 
struct estimators for the parameters of this distribution and we study their asymptotic 
behavior. An analogous program is carried out for a multivariate t-like law (see ^. 
In §^1H5] we consider models of risk factors obtained by "deforming" the marginals of 
symmetric sub-Gaussian a-stable and multivariate t-distributed random vectors, respec- 
tively. Equivalently, using the language of copulas, we consider models of risk factors 
with symmetric a-stable resp. t-distributed marginals (possibly with different parame- 
ters) on which a sub-Gaussian a-stable resp. multivariate t copula is superimposed. In 
^ we provide an extensive back-testing study of all parametric families of distributions 
using real data, on portfolios containing both standard and exotic options, relying both 
on full revaluation of the portfolio value and on its quadratic approximation. 

We conclude this introduction with a few words about notation: given a (possibly 
random) d-dimensional vector X, we shall denote its i-th component, 1 < i < d, by Xi. 
The inverse of an invertible function / will be denoted by /^. The Gaussian measure 
with mean m and covariance matrix Q will be denoted N{m, Q). We shall write X ^ C, 
with X a random variable and C a probability measure, to mean that the law of X is 
C The a-stable measure on the real line with index a, skewness /?, scale a and location 
fj, is denoted by Sa{cr, [3, fi), and we always use the parametrization adopted in [23] . 
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2 Multivariate stable-like risk factors 

2.1 Description of the model 

Given a probability space (il, J^, P), let X : i7 — t- M be a random vector of risk factors 
such that 

X = A^I'^G, (1) 

where A = diag(^i, . . . , A^) is a diagonal random matrix with independent entries, 

^.-'S,^/2((cos^)'/°M,0) Vz = l,...,d, (2) 

and G is a M -valued Gaussian random vector, independent of A, with mean zero and 
covariance matrix Q. In ([2]) we assume Oj G]1, 2[ for alH = 1, . . . , d. 

Note that ([1]) and ([2]) imply that, for each i = 1, . . . ,d, the i-th marginal of X has 
distribution S^. (iTj, 0,0), where E[G^] = 2af. In particular, risk factors are allowed to 
have different indices of tail thickness Oj, and they are dependent through the Gaussian 
component G. 

2.2 Estimation 

Let X{t), t = 1, . . . ,n be independent samples from the distribution of X. For p < 
mini<j<(^(ai)/2, define the (improper) sample p-th moment as 



Mp(n)=n-i5^X,(t)<P>X,(i)<^\ 



t=i 



where X^^' := |X|^sgn(X). Note that, by Cauchy-Schwarz' inequahty, we have 



^XiXj\P < (E|Xi|2p)^/^(E|Xj|2p)^/^ < oo, 



thus also, by Kolmogorov's strong law of large numbers. 



hm Mp{n) = E{XiXj)'^P^ a.s.. 



Since the random matrix A and the random vector G are independent, one has 



E{XiXj)'^P^ = (EAP/^j(KAf^jE{GiGj)'^P\ 



where (see e.g. [23l p. 18]) 

J. ap/2 2P/^T{l-p/ai) ( 2<^i'^\'^f vraA^ pvr 

EAy = — r35 /n , . 9 — ;- 1 + tan^ — — M cos — — cos — - 



G. 



oii,p- 



The constant Ga,p can be computed explicitly, recalling that 



/ «-P/2-i sin^udu = -2P/2-1 cos -^ T{-p/2). 



Let us now define the function 



fp--]-hl[ 



q^ 



niZiZ2)'^^\ 



where Zi, Z2 are jointly normal random variables with covariance matrix 



1 q 

q 1 



For any given p < niinj(aj)/2, matching the theoretical signed p-th moments of XiXj 
with their sample counterparts, we obtain the following estimator for the matrix Q: 



n 



-^T.'i=iXi{tY''^x,{tYpy 



Qij = 2aiajfp nplp'pr r ' h J = 'i-, ■ ■ ■ ,d. 

If {(7.j}i and {aj}j are not known a priori, but we rather have only consistent estimators 
\pin\i and {ajnli; respectively, one can easily deduce (by several applications of the 
continuous mapping theorem), that the estimator of Q obtained replacing a^ with qj„ 
and Gi with din in the above expression is still consistent. 

Remark 1. (i) As far as the estimation of the covariance matrix Q is concerned, the 
heavy tailed assumption does not imply any extra computational burden. 

(ii) For our purposes, it is enough to choose p = 1/2, as we always assume Oj > 1 
for all i = 1, . . . , d (as is well-known, this is equivalent to assuming that all returns have 
finite mean). 

(iii) Unfortunately we are not aware of an explicit expression for the function q i— )■ 
fp{q)- However, it can be expressed as an integral with respect to a Gaussian measure 



inM^: 



Up) = r / (xiX2)<^>e-5(Q '^'^)dx 



1 / . , /„\ -^-^T-(x'i—2ax^xo+x'i. 



27rv'l - g2 



(^,^2)<^>e"^a^^^^-'^^^^^+^^^ dxr dx^ (3) 



which can be computed by numerical integration with essentially any accuracy. Figure 
[T] plots the function /i^ on the interval [0, 1[. 

[Figure 1 about here.] 

Let us consider a simplified case: d = 2, cri = cr2 = l/v2, and qi, 02 given. The 
assumption d = 2 is harmless, as in any case the method works componentwise. The 
case of unknown aj and Uj can be dealt with replacing them with their corresponding 
consistent estimators, as discussed above. 



Let us define 






We first prove tlie following lemma: 

Lemma 2. The function /p :] — 1, 1[— )• M zs bounded, continuously differentiable, concave 
increasing on ] — 1,0[ and convex increasing on ]0, 1[. 

Proof. Boundedness follows by concavity of the function x i— >• \x\p for p < 1 and Jensen's 
inequality, that yield 

\fp{q)\ = \&zf^Z^^\ < E\ZiZ2\P < mZiZ2\f < 1, 

where the last inequality follows by Cauchy-Schwarz' inequality and KZf = EZl = 1. 
Continuous differentiability w.r.t. q is immediate by inspection of Q. Differentiating ([3]) 
w.r.t. q twice, one gets (after some cumbersome but elementary calculations) /!((/) > 
for all g g] - 1, 1[, and f^{q) < for g < 0, f^{0) = 0, f^{q) > for q > 0. The lemma 
is thus proved. D 

It is easy to prove that g^ is strongly consistent, i.e. that g„ — ?■ g a.s. as n — )• oo. In 
fact, as above, since p < (minjaj)/2, by Kolmogorov's strong law of large numbers one 
has 

fpiqn) 



n-£;UX.(,)«X.(0'^' ^ ,,^_^.^,„ _ ^^(,, ^,^,_ 



'^ai,p^a2,p 

from which we can conclude thanks to the continuous mapping theorem and the conti- 
nuity of /^. 

We are now going to prove that the estimator dH) is asymptotically normal, under a 
more stringent assumption on the chosen value of p. Let us define the function 



gp-.R^ 



X 1-^ 



It is clear that the estimator Q can be defined as the solution of the equation 

1 " 

^nQp ■■= -Y,9p{X{k)) = E,gp{X) =: fp{q), (5) 

fc=i 
where P„ stands for the (averaged) empirical measure of the sample ^(1), • • • , X{n), i.e. 

1 " 

k=l 

Proposition 3. If p < minj(Qj)/4, then qn is asymptotically normal and satisfies 

V^iQn -q)^ N(^0,f;{qr^E,[gl{X)] - f^{q))), (6) 

where "=4> " stands for convergence in law. 

Proof. We have proved in lemma [2] that fp{q) = E,qgp{X) is a bijection on the open set 
] — 1, 1[, it is continuously differentiable on its domain, and fp{x) ^ for all x g] — 1, 1[. 
Moreover, as it follows from (j3|) and ([5]), one can write 

^{q^ -q) = V^[f^{Fn9p) - f^{Eggp{X))^ . (7) 

We have, by the strong law of large numbers, Pnfi'p — )• lEg^p a.s. as n — )• oo. Recalling 
that by hypothesis p < min,(Qj)/4, it follows that Kqgp{X) < oo, hence, by the central 
limit theorem, 

V^iFngp - E,gp{X)) => N{0,E,gliX) - f^q)). 

An application of the delta method, taking into account the inverse function theorem, 
now yields the result. D 

A shortcoming of the asymptotic confidence interval implied by the above proposition 
is that the asymptotic variance depends on the parameter to be estimated itself. One 
can overcome this problem by a variance stabilizing transformation: let us define the 



function 7 :] — 1, 1[- 



'(p> y{p)\'^ 



7p(g) = EggpiX) - {Eggp{X)) = 



and 

lp_ 

Then, again by the delta method, we obtain 



^ f'{y) 

'^p(^) = / 1/2. . ^y- 



V^M^n) - Mq)) => iv(0, V'piqfj^) = NiO, 1), 
and a corresponding asymptotic confidence interval for q as 

This asymptotic normality result for ipp{qn) would of course be better if we had an ex- 
plicity expression for ipp, which instead needs to be approximated numerically. However, 
since both fp and 7p are smooth functions (i.e. at least C^), constructing a numerical 
approximation of ipp is a rather simple task. 

Remark 4. The proof of the previous proposition, as well as the construction of the 
variance-stabilizing transformation, depend crucially on the assumption that ci, (T2, ai, 
02 are known (cf. the assumptions stated immediately before formula dl])). Therefore, 
for application purposes, the result should be applied with care, and would do only 
as a "first approximation". Nonetheless, it is also quite common in the estimation 
of multivariate models to separate the estimation of the parameters for the marginals 
from the estimation of the dependence structure. It would certainly be interesting to 
obtain asymptotic confidence intervals treating simultaneously Uj, Oj, i = 1,2, and q as 
parameters to be estimated. 



2.3 Simulation 

In view of the results of the previous subsection, we assume that the covariance matrix 
Q is known, hence, with a shght but harmless abuse of notation, we shall write Q instead 
ofQ. 

Random vectors from the distribution of X can be simulated by the following simple 
algorithm: 

(i) generate d independent random variables Z^ ~ iV(0, 1), z = 1, . . . , d, and form the 
random vector Z = (Zi, . . . , Z^) ~ iV(0, /), so that Q^I'^Z ~ iV(0, Q); 

(ii) independently from Z, generate d independent random variables from the distri- 
bution of Ai^ i = 1, . . . , d, as defined in ([2]); 

(iii) setting A = diag(^i, . . . , A^), one has that A'^'''^Q^''^Z is a sample from the d- 
dimensional law of X 

Note that the only computational overhead with respect to the simulation of a Gaus- 
sian vector is the simulation of the stable subordinators, for which nonetheless efficient 
algorithms are available (see e.g. [23]). 

2.4 Extensions 

Let us remark that the model ([1]) for the vector of risk factors can be extended to allow 
for asymmetries. In particular, setting 

X = X + B = A^/'^G + B, 

where i? is a random vector, independent of X, with independent components Bi dis- 
tributed according to the law ScuicBi, 1, 0), we have that the i-th marginal of the vector 
X has distribution 5^; (o"i , /3i , 0) , where 

CTi — 1,0- . -^ 0-g . ; , Pi— ai , ai ■ 

"i "I" ^Bi 



10 



One can then estimate the parameters a^, cjj and /3j fitting (e.g. by maximum hkehhood 
estimation) a general Paretian stable law to observed data, and obtain (corresponding 
estimates of the) values of Uj , 0"^^ : 



a. = (l-A)^/"'a,, aB^=pl'''^a,. 



Note also that, since we assume Oj > 1 for all z = 1, . . . , d, we have EXj = for all i. 
Finally, an estimate of Q can be obtained by a rather involved modification of method of 
fractional moments introduced in ^2.2l above. In particular, assuming d = 2 for simplicity 
and using the notation of ^2.21 let us set 

fp{p;aua2,ci,C2) = nciAiG\/^ + Bi)^pHciAiG\/^ + B2)^P\ 

where ci, C2 are positive constants and Bi, B2 are independent with Bi ~ 5q,. (1,1,0), 
i = 1,2. Then a moment estimator for p can be constructed in a rather obvious way by 
matching theoretical and sample fractional moments using the function /p(-; ai,a2,ci, C2), 
treating Oj, Cj, i = 1, 2, as "known" (by an immediate scaling argument, the constants ci, 
C2 are uniquely determined by ai and cjj, i = 1,2). This method performs unfortunately 
much slower in comparison to the corresponding procedure in the symmetric case, as 
the function fp cannot be computed off-line as in ^2.21 

Moreover, model ([1]) does not allow for tail dependence among different risk factors. 
As a remedy, one may use the series representation of stable subordinators (see e.g. |23j). 
setting 

k=0 

where (7fc)fc>o is a (fixed) sequence of independent standard Gamma random variables. 
The analysis of this model, however, is considerably more involved, and we plan to 
elaborate on these issues in a future work. 



11 



3 Multivariate t-like risk factors 

3.1 Description of the model 

On a probability space (fi, J-", P), let us consider a d-dimensional random vector of risk 

factors X such that 

Xk= 3^ , fc = l,...,d, (8) 

^/Vk/i^k 

where G ~ A^(0, Q) and Vi, . . . ,Vci are independent one-dimensional x^-distributed ran- 
dom variables with parameters ui^.-.^Ud, respectively. We also assume that G and 
(^1, . . . , Vdj are independent. Then, for each k = 1, . . . , d, the /c-th marginal of X is 
distributed according to a Student's t distribution with parameter !/,%., multiplied by 
(Tfc := (EG^)^'^. In particular, as in the case of the previous section, risk factors may 
have different indices of tail thickness (measured by z^^), and their dependence comes 
from the Gaussian component G. 

3.2 Estimation 

Assuming for the time being z^^, /c = 1, . . . , d, to be known, let us estimate the covariance 
matrix Q by the method of moments. We shall assume from now on that Uk > 2 for all 
k, which implies in particular that IEX| < oo for all k. One has 

KXhXk = VUK^EGhGkEV^^^^EVj;^^^ 
= Qhk V^J^kEV^^'^ EV-^'^ 



for all /i 7^ fc, and 



EXl = Qkk ^k EV^^ = al Vk EV7^ 



Denoting, for simplicity, a random variable with 'x^iv^ distribution by V ^ the density of 
V is given by 



12 



so that 



f'OO -I fOO 

Jo 2-/^T{u/2) Jo 

T{u/2 - 1/2) 



\/2r(z./2) 
and, similarly, 



1 f°° u/9 9 - , ir(z^/2-l) 1 

Jo 2 T{v2) V- 



2 V{vl2) v-2 

Here we have used the definition of Gamma function. 



oo 



r(z) = / e-'e-'dt, z>0, 
Jo 

and its "factorial" property T{z + 1) = zT{z). The above calculations yield 

_ 2 r(^,/2) r(^fc/2) 

y/^hT^k ^{Vh/2 - 1) T{Vk/2 - 1) 



and 



Qkk = cfk = '^^k ■ 

^k 



We have thus obtained the following moment estimator for Q: 

. _ 2 r(z.,/2) rK/2) 1" 

y/vhPk rK/2 - 1) r(l/fc/2 -I) nj^^ 

and 

— 9 1 " 

Qfcfc = o"fc = > -'^fc(i) • 

i^ir. n ^-^ 
'^ t=i 

Note that, for each /c, z^^ can be estimate by one-dimensional maximum likelihood on 

the /s-th marginal, thus obtaining a family of consistent estimators z>fc, k = l,...,(i. 

Therefore, the corresponding estimator of Q obtained by substituting in the previous 

expressions each f^ with 1)^, for each k, is still consistent. 



13 



We can now prove that Qhk is asymptotically normal. For compactness of notation, 
we shall set 



a:= 



V2 r(z//2) 



and we shall consider only the case h ^ k. The asymptotic normality of the estimators 
(Tfc can be established analogously (see also ^3.4p . 



Proposition 5. Let d = 2, 



Q 



1 q 
q 1 



and 



Then one has 



where 



n 

qn:=C,,C,,-y^Xi{t)X2{t). 



Vn{qn-q) ^ N{0,v^^^^2{q)), 






Proof. We have Var q^ = Eg^ — q'^ and 



Eg2 = c^^c^^ EXf X| = uiU2C^,^Cl EGlGl E^f ^ EV^-' 



^^^^EG?Gi, 
jyi-21/2-2 ^ ^ 



where we have used the identity EF -^ = (i/ — 2) '^. To compute EGfG|, let us write 



G2 



V^ 



^1 

^2 



with (Zi,Z2) ~ iV(0,/). This yields, recalling that the fourth moment of a standard 
Gaussian measure is equal to 3, 



EGlGl 



q'^EZf + l-q^ = 2q^ + l, 



(9) 
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thus also 



Varg„ = -{2q +l)-q , 



i/i - 2 1^2 - 2 
whence the result follows by the central limit theorem. D 

Remark 6. One could derive from this asymptotic normality result an asymptotic con- 
fidence interval using a variance stabilizing transformation, as shown in the previous 
section. The same caveats discussed in Remark U] apply of course in this case as well. 

3.3 Simulation 

Generating random vectors from the distribution of a multivariate t-like distribution is 
a straightforward modification of the procedure outlined in ^2.31 above. 

3.4 Extensions 

Since marginals of the random vector X follow a univariate t distribution, they are 
symmetric. In order to allow for asymmetric marginals, one may posit X = (Xi, . . . , X^), 

Afc := Xk - % := -rjk, k=l,...,d, 

^/Vk/vk 

where G ^ N{0,Q), and m = (mi, . . . ,7?!^), rj = [rji, . . . ,r]d) G W^- Then for the k- 
th marginal one has that Xk + ??fc follows a noncentral t-distribution. The reason for 
subtracting the vector ij from X is that EX ^ 0, unless m = 0, and it is common to 
assume that risk factors have mean zero. Unfortunately the density of the noncentral 
t law is expressed in terms of a definite integral depending on parameters (see e.g. 
|25j ) , hence maximum likelihood estimation on the marginals becomes numerically quite 
involved. On the other hand, assuming z^^ > 4 for all k, one can use the method of 
moments to construct estimators for i^ = (z^i, . . . , f^), m, i] and Q. In fact, considering 
k fixed and equal to 1 for the sake of simplicity, the constraint EXi = translates into 
the relation 

^lR'T/-l/2 ^ r((t/i-l)/2) 

\j IV [yxl I) 
15 



Since we need to estimate four parameters, we need other three equations. These can be 
obtained by matching the second, third, and fourth sample moments to the corresponding 
theoretical moments, which are known in closed form (see e.g. |17j). 

We should also observe that in general it is not necessary to match moments of 
integer order to obtain consistent and asymptotically normal estimators. One may also 
use fractional moments, as it has been done in the previous section, thus relaxing the 
assumptions on the parameters Vk- For instance, let X be as in ([8]), (i = 2, Q = [„ i], 
and consider the problem of estimating q. Setting gp{xi^X2) = xf X2 , we can write 



Egp{X) = {uiU2f/^'EV~^^^KV~^^^E{GiG2 



,(p> 



Note that E(GiG2)'^' = fp{q), where fp is the function introduced and studied in ^2.2 
and, in analogy to a calculation already encountered in this section, 



^p/2 _ T{iyk/2-p/2) 
2P/^r{uk/2) 



m~''' = ::xz.:LV ^ ^ = ^2. (lo) 



This relation can be used as a basis for a moment estimator, as in ^2.21 Choosing p 
small enough, one does not need to assume i^^ > 2. 

4 Multivariate meta-stable risk factors 

4.1 Description of the model 

On a probability space (il,J^, P), let G ~ N{0,Q) be a d-dimensional random vector 
with det Q 7^ 0, and 



A ~ 5'„„ /o f cos 



VTQo N 2/ao 



)'/"M,o), 



with A and G independent. The random vector X' := A^''^G is then symmetric a-stable 
with characteristic function 
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in particular X' has an elliptically contoured distribution (see e.g. |23j for the properties 
of so-called sub-Gaussian a-stable laws, and [6] for elliptically contoured distributions). 
As is well-known, the marginals of X' are Q-stable with index ao, i.e. they have all the 
same index of tail thickness (as measured by ao)- In order to build a model allowing 
for different tail behavior along different coordinates, one may set X = f{X), with / 
a deterministic (nonlinear) injective function, for instance to "deform" the marginals 
of X' (a large part of the literature on the applications of copulas to risk management 
is centered around this simple idea). A common procedure (see e.g. [TU] in a slightly 
different context) is to define a random vector X = {Xi, . . . , X^) as 

Xk = akF^^{Fa,{X'„)), k = l,...,d, (11) 

with (Tfc, k = 1, . . . ,d positive scaling constants, a^ G]1, 2] for all k = 1, . . . ,d, and the 
diagonal elements of Q are normalized to one. Here and throughout this section F^, 
a g]0,2], stands for the one-dimensional distribution function of a standard symmetric 
stable law with index a. It is clear that the law of the k-th. marginal of X is symmetric 
a-stable with index a^. Using the terminology introduced in [T3], the random vector X 
has a meta-elliptical distribution, which we call meta-stable. Note that X' , hence also 
X, are expected to have nontrivial tail dependence between any two marginals because 
of the common factor A. 

4.2 Estimation 

One can estimate the parameters of a meta-stable distribution thanks to the following 

remarkable relation (see [14^ Thm. 3.1] and also [21]): let X = {Xi, . . . , X^) be a random 

vector with meta-elliptical distribution, and denote the Kendall's r of Xi and Xj, i,j = 

1, ... ,d, hy Tij. Then we have 

2 

l~ij ^ ctrCSlIl ^ij 1 
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which immediately yields the estimator 



TT 



It is worth recalling that Kendall's tau statistic is a [/-statistic of order 2 with a bounded 
kernel, therefore it is asymptotically normal (see e.g. [261 §12.1]). Unfortunately however 
there does not seem to be an explicit expression for the asymptotic variance, at least not 
(to the best of our knowledge) in the cases considered in this paper. One can also infer, 
by an application of the delta method, that the above estimator of Qij, i, j = 1, . . . ,d, 
is also asymptotically normal. 

Moreover, the parameters a^ and a^, k = 1, . . . ,d, can easily be estimated e.g. by 
maximum likelihood on the marginals of X. Finally, the parameter oq can be estimated 
as follows, where, in view of the above, we treat the parameters 6^, Vk and the matrix 
Q as known (in practice they will have to be replaced by their consistent estimators): 
defining the M*^- valued random vector U = ([/i, . . . , Ud) as 

Uk ■■= Fa^{Xk/(7k), k = l,...,d, 

since U^ = Fo,f^{X'f^) for all k = 1,. . . ,d, by the results in Appendix [Xjl we have that 
the law of U admits the density 



pu{ui,... ,Ud;ao) 



ULifcoiFtoiuk)) ' 



where fao is the density of S'a(,(l,0, 0), and h is the function defined in Appendix Rl 2. 
The parameter ao can then be estimated by maximum likelihood. 

4.3 Simulation 

By (jlip we have X^ = akF^{Faa{A^''^Gk), A; = 1, . . . , d, from which a simulation scheme 
completely analogous to that outlined in ^2.31 can be devised. From the computational 
point of view, the main problem is that there is no closed-form expression for the cumu- 
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lative distribution function and for the inverse distribution function of a one-dimensional 
stable law. However, there exist representations of them as integrals, which can be imple- 
mented numerically. This procedure can become computationally very expensive when 
simulating large samples. To reduce the simulation time, one can compute off-line Fa{x) 
and F^{x) for sufficiently many values of a and x, and replace the numerical integration 
by interpolation. Since both {x,a) i— )■ Fa{x) and {x,a) i— )■ F^{x) are smooth functions 
(at least for a > 1), interpolated values provide accurate approximations to numerical 
integrals. 

4.4 Extensions 

The model can easily be extended to accomodate asymmetric marginals: it is enough to 
set 

Xk = Ht{F^M^/^Gk)), k = l,...,d, 

where H^ is the cumulative distribution function of the general asymmetric centered a- 
stable law Sa,, (o"fc , /^fc j 0) . The estimation of this extended model is completely analogous 
to the symmetric case discussed above, with the only difference that the parameters /3fc, 
k = 1, . . . , d, will also have to be estimated. This can be accomplished again by maximum 
likelihood estimation on the marginals. 

5 Multivariate meta-i risk factors 

5.1 Description of the model 

On a probability space {Q,J-,¥), let G and y be a d-dimensional random vector with 
law A^(0, Q) and an independent one-dimensional random variable with x^ distribution 
with i/Q degrees of freedom, respectively. We shall call the law of the random vector 



X' = G/y^V/vQ a multivariate t distribution (with parameters uq and Q). There are 
other possible multivariate generalizations of Student's t distribution (see e.g. [E]), but 
we shall concentrate exclusively on this definition, which seems to be the most widely 
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used in financial applications. 

In complete analogy to the meta-stable model discussed in the previous section, 
the marginals of X have the same tail thickness (as measured by z/q), but one expects 
nontrivial tail dependence between any two marginals because of the common factor V . 
In order to allow for different tail behavior along different coordinates, one can proceed 
as in the previous section. In particular (see e.g. [E]), one may define the d-dimensional 
random vector X as 

Xk = 4F.t(^-o(^fe)), k = l,...,d, (12) 

where (5/j > for all k = l,...,(i, Fy denotes (here and throughout this section) the 
distribution function of a one-dimensional t law with v degrees of freedom, and Qkk = 1 
for all k = 1, . . . ,d. Then the k-th marginal is t distributed with i/^ degrees of freedom, 
thus overcoming the problem of having all marginals with the same tail thickness. The 
distribution of the random vector X belongs to the class of meta-elliptical distributions 
introduced in [T3]. In the latter reference the law of X is called meta-t, terminology 
which we have borrowed here. 

5.2 Estimation 

The estimation algorithm for the meta-i model is completely analogous to the one for the 
meta-stable model. In fact, since X, as just recalled, has a meta-elliptical distribution, 
the matrix Q can be estimated thanks to its relationship with Kendall's tau already 
mentioned in Subsection [3l2 above (cf. also [3 [21] about parameter estimation for the t 
copula). As in the previous Section, Kendall's tau statistics as well as the corresponding 
estimator of Qij, i,j = 1, . . . , d, are asymptotically normal. 

Similarly, the parameters 6^ and z^^, k = 1, . . . ,d, can easily be estimated by max- 
imum likelihood on the marginals of X. Finally, the parameter i'q can be estimated 
by maximum likelihood: treating, for the sake of simplicity, the parameters 5^, i^ki 
k = 1, . . . , d, and the matrix Q as known, let us define the d-dimensional random vector 
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U as 

Uk:=F,^{Xk/Si,), k=l,...,d. (13) 

Recalling the explicit expression for the density of a multivariate t (see e.g. [H]), (HZ]) 
and the result in Appendix [Xll imply that the law of U admits the density 

^^(-^'•••'-'^^-°)= (detQ)V2 ^^VkVw (^ + -o^(Q^V.)) ^ 



n(i+^o"'«i) 



1^0 + 1 
2 



fe=l 

where Ufc := F^{uk)- In practice, of course one needs to replace Uk, 5k and Q with the 
estimates obtained e.g. by the above methods. 

5.3 Simulation 

Random samples from the distribution of X can be generated by a rather straightfor- 
ward modification of the procedure outlined in Subsection 14.31 In fact, the distribution 
function of the univariate t distribution, as well as its inverse, are implemented in sev- 
eral software packages (such as Octave), even though they do not admit a closed- form 
representation]^ 

5.4 Extensions 

As in the meta-stable case, one can generalize meta-t laws to allow for skewed marginals 
replacing F^^, k = 1,. . . ,d, in (|13|) with the cumulative distribution functions of non- 
central t laws, in analogy to the case discussed in ^3. 



6 Estimation of Value-at-Risk by simulation 

We shall denote by L the loss of a portfolio depending on the vector of risk factors 
X. Recall that the Value-at-Risk (VaR) of a portfolio at confidence level /3 (usually 



^It might be better to say that they do, but in terms of hypergeometric functions. 
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/3 = 0.95 or /3 = 0.99) is simply the /3 quantile of the distribution of portfoHo losses. 
Since it is in general very difficult, if not impossible, to obtain analytically tractable 
expressions for the distribution function of the random variable L (even if the density 
function, or the characteristic function, of the vector X is known in closed form), one 
usually estimates quantiles of L by generating random samples from its distribution 
and computing the corresponding empirical quantiles. We shall exclusively deal with 
the so-called parametric (estimated) VaR, in the sense that we fit to observed data the 
parameters of a given family of distributions for the vector X of random factors, and we 
generate random samples from the law of X. In order to obtain a sample from the law 
of L we should know the functional relation between L and X. For a linear portfolio 
(roughly, a portfolio without derivative instruments), one simply has L = {w, X), where 
w G W^. In the more interesting case of a portfolio containing derivatives, one has 
L = f{X), where / : R — ?• M is a nonlinear function. Unless the derivatives in the 
portfolio are very simple, the function / may not admit a closed-form representation, or 
could just be obtained by nontrivial numerical procedures, that would have to be carried 
out for each random sample of X. For this reason one usually relies on approximations 
of the function / of the form 

L^fiO) + {f'{0),X) + l{nO)X,X), 

which is obviously motivated by the second-order Taylor expansion of the function M 3 
X I— )• f{x) around zero. The values of /'(O) and /"(O) are in general determined by 
the so-called greeks (in this case. Delta, Gamma and Theta) of the derivatives in the 
portfolio. Note that in the above approximation the possible dependence of / on time 
can be taken into account by including time in the set of risk factors. 

The analytic computation, or just approximation, of quantiles of quadratic forms in 
random vectors (other than Gaussian) is in general a very difficult task. Simulation is 
hence a viable alternative, as long as one can generate samples from the distribution of 
X. 
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We are going to perform a backtesting study on the four classes of parametric models 
for the distribution of risk factors introduced in Sections 2-5, to which we refer for the 
corresponding estimation and simulation procedures. Value-at-Risk is just estimated by 
empirical quantiles of random samples of L, either obtained by full revaluation, or by the 
above quadratic approximation. In particular, we do not focus on efficient simulation 
methods for quantile estimation, but we are rather interested on the relative performance 
of different distributional hypotheses for risk factors, when tested on real data. 

Let us also recall that all parametric families of multivariate laws that we fit to data 
are symmetric. A detailed comparison of the empirical performance of symmetric models 
and (some of) their asymmetric counterparts is outside the scope of the present paper, 
and it is left as an interesting question for future work. 

7 Empirical tests 

7.1 The data set 

We consider two portfolios of underlyings with quite different characteristics: portfolio 
A is more diversified, while portfolio B is strongly correlated. In particular, portfolio A 
is composed of two US stocks from each of four different industries, while portfolio B is 
composed of eight US stocks from a single industrjo. While portfolio A is, in some sense, 
more realistic (e.g. from the point of view of an investor aiming at holding a reasonably 
diversified portfolio), portfolio B is constructed as a "stress test" portfolio with high tail 
thickness and (potentially) high tail dependence. 

The raw price series are freely available on the Internet, and the returns are calculated 
as daily log-differenceqj. The data set covers the time period from 2-Jan-1991 through 
31-Dec-2008. 



^The selected stocks are Apple, Bank of America, Chevron, Citigroup, Conoco, Microsoft, Johnson 
and Johnson, and Pfitzer for portfolio A and American Express, Banco Santander, Bank of America, 
Barclays, Citigroup, JP Morgan Chase, U.S. Bancorp, and Wells Fargo for portfolio B. 

^We restrict ourselves to consider daily data for two reasons: the first and most important is that 
the industry and regulatory standard is to compute VaR and related risk measures on a daily basis. On 
the other hand, studying lower frequencies (such as weekly or monthly) would considerably decrease the 
size of our samples, possibly invalidating the asymptotic properties of the proposed estimators. 
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Let us provide a few descriptive statistics of the data set. Table [T] displays the 
sample kurtosis for each stock return. Note that all values are (much) larger than 3, 
thus providing (rough) empirical evidence of tail-thickness of the underlying distribution. 
The corresponding adjusted Jarque-Bera test statistic (see e.g. [S]), is reported in the 
last column of Table [T] (p- values are in parentheses) : for each time series the hypothesis 
of an underlying Gaussian distribution is rejected at 1% level. 



[Table 1 about here.] 



7.2 Test portfolios 



For each of the two portfolios, we construct three investment strategies adding to the 
basic linear portfolios containing only the eight underlyings (in equally value-weighted 
proportions) the following positions in options: 

NLL long 10 calls and 5 puts on each asset ("NonLinear Long"); 

NLS short 5 calls and 10 puts on each asset ("NonLinear Short"); 

NLDC short 10 down-and-out calls with barrier equal to 95% of the asset price, and 
short 5 cash-or- nothing put with cash payoff equal to the strike price ( "NonLinear 
Down and Cash"). 

All options are European, at-the-money, and with time to expiration equal to 6 months. 
The nonlinear part of the six test portfolios is synthetic, in the sense that option prices, 
unlike stock prices, are computed on the basis of the information available on the corre- 
sponding underlying and on (a proxy for) the risk-free rate, using Black-Scholes formula 
for standard call and put options, and its variants for barrier and binary optionqfj. Even 
though this procedure is incompatible with the non-Gaussian distributional assump- 
tions we are going to test, this is nonetheless common practice (see e.g. [TB] for a more 
thorough discussion of this issue). 



*We provide formulas for prices and sensitivities of these exotic options in Appendix [B] 
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7.3 Backtesting 

Let us now turn to the analysis of the performance of the four parametric distributions 
for risk factors introduced above, when apphed to the (predictive, i.e. out-of-sample) 
estimation of Value-at-Risk. More precisely, we fit each of the multivariate laws to 
a subset of the time series of stock returns (using a rolling window consisting of 250 
observations), and we estimate the 0.95 and 0.99 quantiles of the distribution of losses 
by simulation, i.e. selecting the corresponding empirical quantiles from a simulated 
sample. In particular, once a random sample from the distribution of X is obtained, we 
translate it into a random sample from the distribution of portfolio losses either by a full 
revaluation of the portfolio value for each sample, or by the usual delta-gamma quadratic 
approximation (see ^. Let \t — £, t] denote the time period over which the parametric 
families of distributions are estimated, where £ stands for the (fixed) length of the rolling 
window. Denoting by VaRj the empirical quantiles of the simulated distribution of losses 
(with risk factors fitted over [t — £, t]), we form the statistic 



it+i = sgn+(Lt_|.i - VaR; 



t), 



for all t G [^, r], where T denotes the length of the time series, Lj stands for the 
observed loss of portfolio value over the period [t — 1, t], and where sgn"*" x = 1 if x > 0, 
and equals zero otherwise. This procedure produces a different set of {^t)e<t<T, for each 
combination of test portfolio, model for risk factors, quantile level (95% and 99%), and 
portfolio revaluation method (full vs. quadratic approximation). 

To assess the accuracy of the VaR estimates, we perform a simple Proportion of 
Failure (PoF) test (cf. j20j). in analogy to the classical likelihood-ratio test. In particular, 
setting 



where (3 G {0.95, 0.99}, 



X 

X :- 



E^*' 



T 

t=£+l 
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one expects C to be asymptotically x^ distributed with one degree of freedom. Therefore, 
the corresponding VaR model can be considered reliable with a 95% confidence level if 
C < Co « 3.84. 

7.3.1 Portfolio A 

The results of the backtesting procedure with full revaluation and with quadratic approx- 
imation for portfolio A are collected in Tables O [3] and Tables IU \E\ respectively, where 
values of x are in the first column, p in the second column, and C in the third column. 
Note that we included, for comparison, VaR estimates obtained under the assumptions 
that risk factors are jointly Gaussian. 

As one may expect, the benchmark Gaussian approach fails at 99% confidence level 
for all three test portfolios. On the other hand, as far as VaR estimates at 95% confidence 
level are concerned, the Gaussian approach is still satisfactory. The same performance is 
displayed by the multivariate t-like approach. The stable-like approach instead is rejected 
by the PoF test only once. We may therefore say that our tests suggest that, between 
the two models constructed by multiplying the marginals of a Gaussian vector by a set 
of independent random variables, the stable-like approach might be preferable. It may 
also be tempting to infer that models with trivial tail dependence cannot adequately 
be used to estimate the probability of large losses of financial portfolios. As we shall 
see below, this conjecture does not seem to be supported by other empirical results. 
Meta-i and meta-stable both perform well, as the corresponding VaR estimates cannot 
be rejected for any one of the test portfolios. Since the estimated values of i^q and 
oq were very large for long portions of the time series, we tested also the performance 
of "degenerate" meta-t and meta-stable models, corresponding to the limiting cases 
1^0 = oo and ao = 2, respectively. As it is well-known, these models correspond to 
certain nonlinear deterministic transformations of Gaussian laws (or, equivalently, to 
laws with t-distributed and a-stable distributed marginals, respectively, and a Gaussian 
copula) . Somewhat surprisingly, these "degenerate" meta-t and meta-stable models give 
very accurate results on our test portfolios. Since these models do have trivial tail 
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dependence, it is impossible to conclude, at least not with the data at hand, that models 
with nontrivial tail dependence should be preferable. In other words, our empirical result 
seem to imply that, for portfolios whose under lyings are not (jointly) strongly dependent, 
the sophistication of models allowing for both heavy tails and tail dependence might not 
be indispensable. 

Completely analogous observations could be made for the estimates of VaR obtained 
by the delta-gamma quadratic approximation of portfolio losses, for which we refer to 
Tables [H and O As in the case of full revaluation, the Gaussian approach performs 
remarkably well at the 95% confidence level. In this respect, it is probably worth recalling 
that obtaining the quantiles of a quadratic form in Gaussian vectors is particularly 
simple and can be done with very little computational effort. In this sense, the classical 
quadratic approximation with Gaussian risk factors could still be regarded as a useful 
tool. 

7.3.2 Portfolio B 

The empirical results in the previous subsection, as already observed, do not offer a 
decisive argument in favor of models featuring both heavy tails and non-trivial tail 
dependence. For this reason it is interesting to perform a back-testing analysis on the 
"stress test" portfolio B, whose underlyings are (presumably) heavy tailed and strongly 
dependent. 

The results with full revaluation and with quadratic approximation for portfolio B 
are collected in Tables [H [7] and Tables [HI [U respectively, where we included again, for 
comparison purposes, VaR estimates obtained under the assumptions that risk factors 
are jointly Gaussian. 

One can see immediately (cf. Tables [6] and [8]) that both the t-like and the stable-like 
models, as well as the standard Gaussian model, are unreliable at the 99% confidence 
level for all portfolios, and even at the 95% confidence level for the test portfolio NLDC 
containing exotic options. This could be interpreted as empirical evidence that these 
classes of models, all of which have trivial tail dependence, are not adequate to estimate 
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the probability of large losses, especially for highly nonlinear portfolios. It does not seem 
possible, however, to assert that the meta-i and meta-stable models, both of which fea- 
ture heavy tails and non-trivial tail dependence, are superior in terms of their empirical 
performance to their degenerate counterparts (i.e. the meta-t and meta-stable models 
with 1/0 = 00 and ao = 2, respectively), which allow heavy tails but no tail dependence. 
In fact (cf. Tables [7] and [9|) , the performance of all meta-t and meta-stable models is 
comparable for the vanilla portfolios NLL and NLS, independently of having non-trivial 
dependence or not (with the exception of the meta-stable model which is rejecte in one 
case, see Table [9|). The picture changes drastically for the exotic portfolio NLDC, for 
which the meta-t, meta-stable and degenerate meta-t models behave poorly. On the 
other hand, surprisingly, the degenerate meta-stable model display a good performance 
at the 99% level. It appears to be very difhcult, if not impossible, to give an explanation 
for this observation. 



[Table 2 about here. 
[Table 3 about here. 



[Table 4 about here, 



[Table 5 about here. 
[Table 6 about here. 
[Table 7 about here. 
[Table 8 about here. 



[Table 9 about here. 



8 Concluding remarks 

Let G ^ N{0,Q), and consider the random vector X obtained from G by variance- 
mixture where 5 is a positive random variable independent of G, that is X := S^'^G. If S 
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has the distribution of the reciprocal of a x^-distributed random variable resp. of an a/2- 
stable subordinator, we obtain the class of multivariate t resp. symmetric a-stable sub- 
Gaussian laws. Plenty of other distributions obtained by variance mixing of a Gaussian 
measure on M that have appeared in the literature for different purposes, including of 
course the modeling of financial risk factors. Similarly, many generalizations of variance 
mixing have appeared in the literature, and it is evident that endless variations on the 
theme are possible. In this article we have limited ourselves to two special cases of 
two possible extensions. Namely, variance mixture models could be generalized setting 
X = T^/'^G, where T is a positive-definite random matrix (cf. [3] for related classes 
of distributions), or one could consider nonlinear images of (the law of) S^'^G, such 
as X = (f){S'^''^G), with </> e.g. a (deterministic) injective function. In particular, if T 
is a diagonal random matrix with independent entries and, for each i = 1, . . . ,(i, Ta 
is distributed like the reciprocal of a x^ random variable with vi degrees of freedom, 
we obtain the class of t-like laws of 33 A completely analogous observation holds for 
the stable-like laws of ^ Similarly, for particular choices of functions 0, we obtain 
meta-elliptical distributions, of which meta-t and meta-stable are just special cases. 

We have considered i-like and stable-like laws because of their simplicity and ease of 
estimation and simulation, whereas the two specific instances of meta-elliptical distri- 
bution have been considered for their seemingly widespread use (at least as regards the 
meta-t law), especially in connection with applications of copula methods. 

Let us mention other possible generalizations of the classical variance-mixture ap- 
proach that have recently appeared in the literature, without any claim of completeness 
(which, as already mentioned above, would not be possible). Assume X = T^''^G, with 
the same notation as above, where T is diagonal, Ta = F^{U) for each i = 1, . . . ,d, U 
is a uniformly distributed random variable independent of G, and and Fi, i = 1, . . . ,d 
are cumulative distribution functions. In the particular case in which each Fi is the 
cumulative distribution function of the reciprocal of a (rescaled) x^"distributed ran- 
dom variable with i/i degrees of freedom, we obtain the class of grouped t-distributions 
(see e.g. [H El E]). Of course nothing prevents us from considering arbitrary distribu- 
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tions functions as -Fj's. By combining this construction with a nonUnear map, so that 
X = (j){T^''^G), one could for instance construct laws with the dependence structure of a 
grouped-t law and with arbitrary marginals (see e.g. [8] for the so-called grouped-i cop- 
ula). The reader clearly understand that the possibilities for constructing multivariate 
laws by any of these methods, or a combination thereof, are endless. 

Our empirical results suggest that, among the infinitely many possible models for 
risk factors with non-trivial tail dependence, both classes of meta-t and meta-stable laws 
offer good performance, at least on reasonably diversified portfolios. Nevertheless, there 
is weaker evidence that, under "extreme" conditions such as those characterizing our 
portfolio B, these classes of models can perform well in highly non-linear situations. 
We believe that the most important message of our paper is that it is indeed worth 
taking into account that "classical" multivariate Gaussian laws with changed marginals 
(in particular a-stable) might perform surprisingly well. Such (relatively) simple models 
are undoubtedly attractive from the viewpoint of practical implementation, as their 
estimation, simulation and quadratic approximation are very easy and "light" in terms 
of computational resources. 

A Densities of some random vectors 

A.l Densities of a class of images of random vectors 

Let (p G C^(IR'^ — )■ M"^) be an injective function of the type 

(j) : {xi,X2,...,Xd) ^ {(j)i{xi),<j)2{x2),---,4>d{xd)), 

for functions (/>fc : M — t- R, A; = 1, . . . , d. 

Proposition 7. Let X be a d-dimensional random vector with density p. Then the 
density of (j){X) is the function 

{yi,...,yk)^ 1 
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Proof. By the multidimensional change of variable formula for Lebesgue integrals and 
by the inverse function theorem we have, for any measurable set A, 



¥{(j){X) eA)= ¥{X e (t>^\A)) = [ p{x) dx 

= [ p{^-Hy))\detD^-\y)\dy 
Ja 

A \<ieiD(i){(t) ^[y))\ 

^(^'^^^^'•••'^^"'y^ dy,...dy,, 

thus proving the claim. D 

A. 2 Densities of sub-Gaussian a-stable vectors 

Let A and G be as in Section HI and set Y = A^i'^G. Let us recall that the law of G 
admits the density 

^^^"^ = (2.)^/^ (detQ)V2 ^"P ( - \^Q~'-^-))- 
Therefore, for any constant c G M, we have 

Icoix) = TTTTT^ TTTTtC" '^ CXD ( {Q~^X,x) ) . 

'^^ ^ (27r)^/2(detQ)i/2 ^\ 2c^^ ') 

For any measurable set B, recalling that A and G axe independent, we have 

P(y £B)= F{a^/^G G B)pA{a) da = / jaQix) dxpA{a) da, 

Jo Jo JB 

where pA denotes the density of the law of A. Therefore the law of Y admits a density 

h{x) = g{{Q~^x,x)), where 

1 1 /"°° 
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In fact g can be extended by continuity at 2; = 0, since the above integral with the 
exponential term suppressed is well-defined, e.g. using the series expansion at zero for 
the density of Sa/2{i, 1,0) of ^ p. 99]. 



B Prices and sensitivities of some exotic options 

Throughout this appendix we place ourselves in a standard Black-Scholes model with 
one "underlying" stock, whose price process is denoted by Sj, < t < T, and whose 
(constant) volatility is denoted by a. The risk-free rate will be denoted by r. We shall 
consider options written on the stock, denoting the exercise time by T, the strike price 
by K, and the barrier by H. 

In the following table we collect the definitions, in terms of their payoff, of some 
barrier and binary options. 



Name 



Payoff 



Down-and-In call 
Down-and-In put 
Down-and-Out call 
Down-and-Out put 
Up-and-In call 
Up-and-In put 
Up-and-Out call 
Up-and-Out put 
Cash-or-Nothing call 1 



max(5T — K,0 
max(i^ — St, 
max(S'T — K,0 
inax{K — St, 
max(5'r — K,0 
inax{K — St, 
max(5T — K,0 
max(i<r — St,0 



if mino<t<T St < H 
if mino<j<T St < H 
if mino<t<T St > H 
if mino<i<T St > H 
if maxo<i<T St > H 
if maxo<t<T St > H 
if maxo<t<T St < H 
if maxo<i<r St < H 
if St > K 
if St < K 



Cash-or-Nothing put 1 

We shall use C^i and Pdi to denote the price (at time zero) of a down-and-in call and 
a down-and-in put, respectively. Completely analogous notation will be used for the 
remaining options, replacing the subscripts accordingly. The price of plain European call 
and put options will be denoted by Cbs and Pbs, respectively. The price at time zero 
of a European call option with strike K and exercise time T, written on an underlying 
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whose price at time zero is 5o, will be denoted by Cbs{So,K,T). The corresponding 

notation will be also used for European put options. 

Setting 

2r r \ 

a^ a'^ 2 

and assuming H < K^ one has (see e.g. [7]), 

Cdi = H Sq Cbs{H Sq ,K,T), 

Pdi = Cdi + KH-'PBs{So,H,T) - {HS^^f^-^HK-^CBs{KHS^\K^H-\T), 

Cui = Cbs 

Pui = H^S^^Pbs{H^S^\K,T). 

By the obvious identities 

Cdi + Cdo = Cbs, Cui + Cuo = Cbs, 

and the corresponding ones for put options (i.e. those obtained replacing C with P), 
we obtain pricing formulas for all barrier options listed in the above table. By the well- 
known formulas for sensitivities of European call and put options, elementary calculus 
yields 

- H^+^S^''~^Abs{H^S^\K,T), 
^ = X(X + l)H^S^^''CBsiH'S^\K,T) 

+ 2{X + 1)H^+^Sq^-''Abs{H^So\K,T) 
+ H^+^So^-^Tbs{H^Sq\K, T), 
^ = H'S^'QBsiH'S^\K,T). 

Similar expressions can be derived for the sensitivities of the other binary options. 

33 



Setting 

^^ ._ log(go/i 

we have (see e.g. 



^^^^log(5o/A-) + (. + .^)r_ ,^^^,^_^^_ 



C„ = e-'"<l.(rf2), P„ = e-'-'"4.(-rf2), 

where $(•) stands for the distribution function of the Gaussian law on M with mean zero 
and unit variance. The sensitivities of binary options are just an exercise in elementary 
calculus. Let us include, for the sake of completeness, the sensitivities of the cash-or- 
nothing put, which is used in our portfolios: 

dPcn _ -e-'^^'{-d2) 

dSo aSoVf ' 

^^-_ -rT^( .X , re-'-^'^'{-d2)log{So/K) r - ^^2 
dT '''' ^ "^^'^ 2aT3/2 aVT ' 
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Figure 1: Plot of the function fp, with p = 1/2. 
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Table 1: Descriptive statistics of financial series 



This tabic reports the sample kurtosis and the adjusted Jarque and Bera tost for the log-returns of the analyzed 
time series. 



American Express 

Apple 

Banco Santander 

Bank of America 

Barclays 

Chevron 

Conoco 

Citigroup 

Johnson & Johnson 

JPM Chase 

Microsoft 

Pfitzer 

U.S. Bancorp 

Wells Fargo 



"Curtosi: 
9.084 


57.786 


10.040 


25.918 


15.267 


13.491 


8.373 


38.374 


9.735 


11.255 


8.186 


5.914 


22.514 


20.869 



Adj. J&B 

7066 
(0.00) 
573091 
(0.00) 

9452 
(0.00) 
99815 
(0.00) 
28907 
(0.00) 
20904 
(0.00) 

5529 
(0.00) 
237526 
(0.00) 

8627 
(0.00) 
12946 
(0.00) 

5105 
(0.00) 

1632 
(0.00) 
72736 
(0.00) 
60866 
(0.00) 
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Table 2: Value-at-Risk backtesting (full revaluation) 



This table reports the results of a Value-at-Risk backtesting 
(portfolio A). Panels A and B report the results for the lonj 
Panel C reports the results for the down-and-out and cash-or- 
asterisk indicate that the corresponding model is not reliable. 



with full revaluation of the first portfolio 
j; and short portfolios, respectively, while 
nothing portfolio. Values marked with an 



Panel A: NLL 



t-like95% 
t-likegg% 
Stable- likeg5% 
Stable- likeggjj 
Gaussian95% 
Gaussiang9% 



Violations 
204 

64 
205 

66 
191 

61 



Percentage 

4.76% 
1.49% 
4.78% 
1.54% 
4.45% 
1.42% 



LR 
0.54 
9.13* 
0.44 
10.81* 
2.79 
6.84* 



Panel B: NLS 



f-like95% 
t-like99% 
Stable- like95% 
Stable- like99% 
Gaussian95% 
Gaussian99% 



Violations 
225 

64 
223 

49 
207 

63 



Percentage 
5.25% 
1.49% 
5.20% 
1.14% 
4.83% 
1.47% 



LR 

0.54 

9.13* 

0.36 

0.84 

0.27 

8.33* 



Panel C: NLDC 



f-like95% 
i-like99% 
Stable- likeg5% 
Stable- like9g% 
Gaussian95% 
Gaussian99% 



Violations 
201 

59 
207 

47 
212 

65 



Percentage 

4.68% 
1.35% 
4.83% 
1.10% 
4.94% 
1.52% 



LR 
0.90 
5.48* 
0.27 
0.39 
0.28 
9.95* 
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Table 3: Value-at-Risk backtesting (full revaluation) 

This table is a continuation of Table [2] The same notation is used here. 

Panel A: NLL 



Meta-i95% 
Meta-t99% 
Meta-stableg5% 
Meta-stableg9% 
Meta.-tg5% {vo = oo) 
Meta-tg9% {vo = oo) 
Meta-stable95% (ao 
Meta-stable99% (ao 



2) 
2) 



Violations 
200 

46 
223 

48 
200 

52 
197 

51 



Percentage 
4.66% 
1.07% 
5.20% 
1.12% 
4.66% 
1.21% 
4.59% 
1.19% 



LR 
1.04 
0.22 
0.36 
0.59 
1.04 
1.83 
1.53 
1.46 



Panel B: NLS 



Meta-i9g% 
Meta-i99% 
Meta-stableg5% 
Meta-stablegg^ 
Meta-ig5% {uo = oo) 
Meta-igg% {fo = oo) 
Meta-stablegg% (ao — 2) 
Meta-stableg9% (ao = 2) 



Violations 
217 

48 
233 

40 
210 

50 
217 

39 



Percentage 
5.06% 
1.12% 
5.43% 
0.93% 
4.90% 
1.17% 
5.06% 
0.91% 



LR 
0.03 
0.59 
1.65 
0.20 
0.10 
1.13 
0.03 
0.37 



Panel C: NLDC 



Meta-i95% 
Meta-t99% 
Meta-stableg5% 
Meta-stablegg% 
Meta-ig5% {uq = oo) 
Meta-tgg% {vo = oo) 
Meta-stableg5% (ao = 2) 
Meta-stablegg% (ao — 2) 



Violations 
219 

48 
224 

39 
220 

53 
208 

36 



Percentage 
5.11% 
1.12% 
5.22% 
0.91% 
5.13% 
1.24% 
4.85% 
0.84% 



LR 
0.10 
0.59 
0.45 
0.37 
0.15 
2.24 
0.20 
1.18 
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Table 4: Value-at-Risk backtesting (quadratic approximation) 

This table reports the resuhs of a Value-at-Risk backtesting with quadratic approximation of the first 
portfolio (portfolio A). Panels A and B report the results for the long and short portfolios, respectively, 
while Panel C reports the results for the down-and-out and cash-or-nothing portfolio. Values marked 
with an asterisk indicate that the corresponding model is not reliable. 



Panel A: NLL 



i-like95% 
t-like99% 
Stable- like95% 
Stable- like99% 
Gaussian95% 
Gaussian99% 



Violations 
203 

63 
204 

65 
188 

61 



Percentage 

4.73% 
1.47% 
4.75% 
1.52% 
4.38% 
1.42% 



LR 
0.65 
8.33* 
0.54 
9.95* 
3.56 
6.84* 



Panel B: NLS 



i-like95% 
i-like99% 
Stable- like95% 
Stable- like99% 
Gaussian95% 
Gaussian99% 



Violations 
228 

66 
232 

49 
215 

63 



Percentage 
5.32% 
1.54% 
5.41% 
1.14% 
5.01% 
1.47% 



LR 
0.89 
10.81* 
1.48 
0.84 
0.00 
8.33* 



Panel C: NLDC 



i-like95% 
i-like99% 
Stable- like95% 
Stable- like99% 
Gaussian95% 
Gaussian99% 



Violations 
185 

58 
204 

45 
208 

64 



Percentage 
4.31% 
1.35% 
4.76% 
1.05% 
4.85% 
1.49% 



LR 
4.44* 
4.85* 
0.54 
0.10 
0.20 
9.13* 
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Table 5: Value-at-Risk backtesting (quadratic approximation) 

This table is a continuation of Table |4] The same notation is used here. 

Panel A: NLL 



Meta- 
Meta- 
Meta- 
Meta- 
Meta- 
Meta- 
Meta- 
Meta- 



^95% 
^99% 

stableg5% 
stableg9% 

^95% (j'O = 
^99% (t'O = 

stable95% 
stable99% 



ooj 
oo) 



(ao = 2) 
(ao = 2) 



Violations 
200 

44 
219 

45 
200 

52 
193 

50 



Percentage 
4.66% 
1.03% 
5.11% 
1.05% 
4.66% 
1.21% 
4.50% 
1.17% 



LR 
1.04 
0.03 
0.10 
0.10 
1.04 
1.83 
2.32 
1.13 



Panel B: NLS 



Meta- 
Meta- 
Meta- 
Meta- 
Meta- 
Meta- 
Meta- 
Meta- 



^95% 
^99% 

stable95% 
stable99% 

^95% {l^O 

h9% (i^o = 
stable95% 
stable99% 



= oo) 

= oo) 
(ao = 2) 
(ao = 2) 



Violations 
225 

50 
236 

42 
220 

52 
227 

44 



Percentage 
5.25% 
1.17% 
5.50% 
0.98% 
5.13% 
1.21% 
5.29% 
1.03% 



LR 
0.54 
1.13 
2.22 
0.02 
0.15 
1.83 
0.77 
0.03 



Panel C: NLDC 



Meta- 
Meta- 
Meta- 
Meta- 
Meta- 
Meta- 
Meta- 
Meta- 



^95% 



stable95% 
stable99% 

^95% (t'O = 
^99% (t'O = 

stable95% 
stable99% 



(ao = 2) 

{ao = 2) 



Violations 
216 

46 
223 

41 
212 

49 
200 

35 



Percentage 
5.04% 
1.07% 
5.20% 
0.96% 
4.94% 
1.14% 
4.66% 
0.82% 



LR 
0.01 
0.22 
0.36 
0.08 
0.03 
0.84 
1.04 
1.56 
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Table 6: Value-at-Risk backtesting (full revaluation) 



This table reports the results 
(portfolio B). Panels A and B 
C reports the results for the 



of a Value-at-Risk backtesting with full revaluation of the second portfolio 
report the results for the long and short portfolios, respectively, while Panel 
down-and-out and cash-or-nothing portfolio. Values marked with an aster 



Panel A: NLL 



t-like95% 
f-like9g% 
Stable- Iikeg5% 
Stable- like99% 
Gaussian95% 
Gaussian99% 



Violations 
217 

72 
232 

88 
208 

70 



Percentage 
5.06% 
1.68% 
5.41% 
2.05% 
4.85% 
1.63% 



LR 

0.03 
16.59* 

1.48 
36.77* 

0.20 
14.55* 



Panel B: NLS 



i-like95% 
i-like99% 
Stable- like95% 
Stable- like99% 
Gaussian95% 
Gaussian99% 



Violations 
217 

59 
230 
507 
203 

64 



Percentage 
5.06% 
1.37% 
5.36% 
1.33% 
4.73% 
1.63% 



LR 
0.03 
5.48* 
1.17 
4.26* 
0.65 
9.13* 



Panel C: NLDC 



i-like95% 
i-like99% 
Stable- like95% 
Stable- like99% 
Gaussian95% 
Gaussian99% 



Violations 
253 

78 
250 

68 
250 

85 



Percentage 
5.90% 
1.82% 
5.83% 
1.59% 
5.83% 
1.98% 



LR 
6.93* 
23.37* 
5.92* 
12.62* 
5.92* 
32.50* 
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Table 7: Value-at-Risk backtesting (full revaluation) 

This table is a continuation of Table [6] The same notation is used here. 

Panel A: NLL 



Meta-i95% 
Meta-t99% 
Meta-stableg5% 
Meta-stableg9% 
Meta.-tg5% {vo = oo) 
Meta-tg9% {vo = oo) 
Meta-stable95% (ao 
Meta-stable99% (ao 



2) 
2) 



Violations 
227 

51 
236 

45 
223 

54 
221 

46 



Percentage 
5.50% 
1.05% 
5.17% 
0.96% 
5.20% 
1.26% 
5.15% 
1.07% 



LR 

2.22 
0.10 
0.28 
0.08 
0.36 
2.69 
0.21 
0.22 



Panel B: NLS 



Meta-i9g% 
Meta-i99% 
Meta-stableg5% 
Meta-stablegg^ 
Meta-ig5% {uo = oo) 
Meta-igg% {fo = oo) 
Meta-stablegg% (ao — 2) 
Meta-stableg9% (ao = 2) 



Violations 
223 

44 
241 

41 
219 

49 
223 

44 



Percentage 
5.20% 
1.03% 
5.62% 
0.99% 
5.10% 
1.14% 
5.20% 
1.03% 



LR 

0.36 
0.03 
3.34 
0.08 
0.10 
0.84 
0.36 
0.03 



Panel C: NLDC 



Meta-i95% 
Meta-t99% 
Meta-stableg5% 
Meta-stablegg% 
Meta-ig5% {uq = oo) 
Meta-t99% (vo = oo) 
Meta-stable95% (ao = 2) 
Meta-stable99% (qq = 2) 



Violations 
259 

68 
266 

59 
255 

69 
252 

49 



Percentage 
6.04% 
1.59% 
6.20% 
1.38% 
5.95% 
1.61% 
5.88% 
1.14% 



LR 

9.18* 
12.62* 
12.18* 
5.48* 
7.65* 
13.57* 
6.59* 
0.84 
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Table 8: Value-at-Risk backtesting (quadratic approximation) 

This table reports the results of a Value-at-Risk backtesting with quadratic approximation of the second 
portfolio (portfolio B). Panels A and B report the results for the long and short portfolios, respectively, 
while Panel C reports the results for the down-and-out and cash-or-nothing portfolio. Values marked 
with an asterisque indicate that the corresponding model is not reliable. 



Panel A: NLL 



i-like95% 
t-like99% 
Stable- like95% 
Stable- like99% 
Gaussian95% 
Gaussian99% 



Violations 
214 

71 
229 

84 
207 



Percentage 
4.99% 
1.66% 
5.33% 
1.96% 
4.83% 
1.61% 



LR 

0.00 
15.55* 

1.02 
31.12* 

0.27 
13.57* 



Panel B: NLS 



i-like95% 

i-like99% 

Stable-like95% 

Stable-like99% 

Gaussian95% 

Gaussian99% 



Violations 
221 

65 
242 

57 
206 



Percentage 

5.15% 
1.52% 
5.64% 
1.33% 
4.80% 
1.59% 



LR 
0.21 
9.95* 
3.60 
4.26* 
0.35 
12.62* 



Panel C: NLDC 



i-like95% 
i-like99% 
Stable- like95% 
Stable- like99% 
Gaussian95% 
Gaussian99% 



Violations 
244 

76 
243 

65 
243 

83 



Percentage 
5.69% 
1.77% 
5.67% 
1.52% 
5.87% 
1.94% 



LR 
4.13* 
21.01* 
3.86* 
9.95* 
3.86* 
29.77* 
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Table 9: Value-at-Risk backtesting (quadratic approximation) 

This table is a continuation of Table [8] The same notation is used here. 

Panel A: NLL 



Meta- 
Meta- 
Meta- 
Meta- 
Meta- 
Meta- 
Meta- 
Meta- 



^95% 
^99% 

stableg5% 
stableg9% 

^95% (j'O = 
^99% (t'O = 

stable95% 
stable99% 



ooj 
oo) 



(ao = 2) 
(ao = 2) 



Violations 
224 

51 
233 

45 
220 

53 
221 

44 



Percentage 
5.22% 
1.19% 
5.43% 
1.05% 
5.13% 
1.24% 
5.15% 
1.03% 



LR 

0.45 
1.46 
1.65 
0.10 
0.15 
2.24 
0.21 
0.03 



Panel B: NLS 



Meta- 
Meta- 
Meta- 
Meta- 
Meta- 
Meta- 
Meta- 
Meta- 



^95% 
^99% 

stable95% 
stable99% 

^95% {l^O 

h9% (i^o = 
stable95% 
stable99% 



= oo) 

= oo) 
(ao = 2) 
(ao = 2) 



Violations 
228 

48 
249 

45 
227 

54 
233 

45 



Percentage 
5.32% 
1.12% 
5.80% 
1.05% 
5.29% 
1.26% 
5.43% 
1.05% 



LR 

0.89 
0.59 
5.60* 
0.10 
0.77 
2.69 
1.65 
0.10 



Panel C: NLDC 



Meta- 
Meta- 
Meta- 
Meta- 
Meta- 
Meta- 
Meta- 
Meta- 



^95% 



stable95% 
stable99% 

^95% (t'O = 
^99% {l^O = 

stable95% 
stable99% 



: OO) 

= oo) 
(ao = 2) 

(«o = 2) 



Violations 
253 

66 
262 

56 
250 

66 
244 

46 



Percentage 
5.90% 
1.54% 
6.11% 
1.31% 
5.83% 
1.54% 
5.69% 
1.07% 



LR 

6.93* 
10.81* 
10.42* 
3.70 
5.92* 
10.81* 
4.13* 
0.22 
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